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ABSTRACT Sequence divergence acts as a potent barrier 
to homologous recombination; much of this barrier derives 
from an antirecombination activity exerted by mismatch 
repair proteins. An inverted repeat assay system with recom- 
bination substrates ranging in identity from 74% to 100% has 
been used to define the relationship between sequence diver- 
gence and the rate of mitotic crossing-over in yeast. To 
elucidate the role of the mismatch repair machinery in 
regulating recombination between mismatched substrates, we 
performed experiments in both wild-type and mismatch re- 
pair defective strains. We find that a single mismatch is 
sufficient to inhibit recombination between otherwise identi- 
cal sequences, and that this inhibition is dependent on the 
mismatch repair system. Additional mismatches have a cu- 
mulative negative effect on the recombination rate. With 
sequence divergence of up to approximately 10%, the inhibi- 
tory effect of mismatches results mainly from antirecombi- 
nation activity of the mismatch repair system. With greater 
levels of divergence, recombination is inefficient even in the 
absence of mismatch repair activity. In both wild-type and 
mismatch repair defective strains, an approximate log-linear 
relationship is observed between the recombination rate and 
the level of sequence divergence. 



Recombination between sequences at identical (allelic) posi- 
tions on homologous chromosomes is important for mitotic 
DNA repair and for meiotic chromosome disjunction. In 
addition to the normal allelic interactions, eukaryotic genomes 
contain large numbers of repeated sequences that can poten- 
tially serve as substrates for nonallelic (ectopic) recombination 
events. Ectopic recombination can be either reciprocal or 
nonreciprocal in nature and can lead to alterations in genome 
structure. Nonreciprocal recombination events (gene conver- 
sions), which involve the unidirectional transfer of information 
from one repeat to another, result in either the homogeniza- 
tion of repeats or the generation of novel sequences. Recip- 
rocal recombination events (crossovers) involve the physical 
exchange of information between repeats and give rise to 
various types of genome rearrangements (e.g., duplications, 
deletions, inversions, and translocations). Given the genome- 
destabilizing effects of ectopic recombination, it is important 
that such events occur at low rates relative to allelic recom- 
bination. 

Two physical features of repeated sequences may be impor- 
tant for limiting potentially deleterious interactions (1): the 
total length of the repeats (substrate size) and the degree of 
sequence identity between the repeats (substrate homology). 
Studies in both prokaryotic systems (2, 3) and eukaryotic 
systems (4-8) have shown that substrate size does influence 
the efficiency of recombination. Similarly, studies done in a 
variety of organisms have shown that sequence divergence can 
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be a potent barrier to homologous recombination (9-23). The 
molecular basis of this barrier is not entirely understood, but 
it is presumably related to the formation and/or removal of 
mismatches in recombination intermediates (24, 25). Prokary- 
otic studies have demonstrated that the recombination barrier 
afforded by sequence divergence derives largely from antire- 
combination activity of the replication-associated mismatch 
repair machinery (10, 11, 13, 14). 

The best characterized mismatch repair system is the methyl- 
directed mismatch repair system of Escherichia coli y which 
contains three key components (MutS, MutL, and MutH). 
MutS binds to mismatched bases; MutH incises the newly 
replicated unmethylated DNA strand at hemi-methylated dam 
sites; and MutL functions as a molecular matchmaker to bring 
MutS and MutH together (26). Eukaryotes possess multiple 
MutS and MutL homologs (no MutH homologs have been 
identified), which have attracted attention because of their role 
in preventing the accumulation of cancer-promoting mutations 
(27). In the yeast Saccharomyces cerevisiae six MutS homologs 
(Mshlp-6p) and four MutL homologs (Pmslp, Mlhlp-3p) have 
been identified. Msh2p, Msh3p, Msh6p, Pmslp, and Mlhlp are 
involved in correcting DNA replication errors; strains defec- 
tive for any of these proteins have a mutator phenotype (28, 
29). In addition to editing the products of DNA replication, 
there is evidence that these proteins also regulate recombina- 
tion between mismatched DNA substrates in yeast (17-19, 22). 
Although elimination of MutS and MutL homologs can yield 
comparable mutator phenotypes, MutS homologs appear to 
play a greater role than MutL homologs in regulating recom- 
bination between diverged sequences (17, 18). 

We described previously an intron-based recombination 
system that can be used to examine reciprocal recombination 
between diverged sequences in yeast (17). This system was 
used to measure mitotic crossing-over between identical 
(100%) sequences, 91%-identical sequences, and 77%- 
identical sequences in both wild-type and mismatch repair 
(MMR) defective strains. Recombination between the 91%- 
and the 77%-identical sequences was very inefficient relative 
to recombination between 100% substrates in wild-type 
strains. In MMR-defective strains, however, recombination 
between the 91% substrates improved dramatically and was 
almost as efficient as that between 100% substrates; recom- 
bination between the 77% substrates remained inefficient in 
MMR-defective strains. These results indicate that the MMR 
machinery imposes the major barrier to recombination be- 
tween sequences of low divergence; at higher levels of diver- 
gence, the sequences themselves may impose a barrier to the 
recombination machinery. In the work reported here, a large 
number of recombination substrates exhibiting a wide range of 
sequence divergence have been used to determine the number 
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of mismatches necessary to trigger the antirecombination 
activity of the yeast MMR machinery and to define more 
precisely the distinct roles of DNA sequence divergence and 
mismatch repair activity in limiting genomic rearrangements. 

MATERIALS AND METHODS 

Media and Growth Conditions. S. cerevisiae strains were 
grown at 30°C. Yeast extract/peptone (YEP) medium (1% 
yeast extract/2% Bacto-peptone; 2.5% agar for plates) sup- 
plemented with 2% glycerol and 4% galactose (YEPGG) or 
2% dextrose (YEPD) was used for nonselective growth. Syn- 
thetic complete medium (30) supplemented with 2% glycerol 
and 4% galactose but deficient in histidine (SGG-his) was used 
to select for prototrophs in the His + rate measurement exper- 
iments. 

Plasmid Constructions. Plasmid pSR266 contains the 
pGAL'HIS3vintron construct (for details, see ref. 17) and was 
used as the starting point for constructing chicken j3-tubulin 
(cj3) inverted repeat substrates according to the general 
scheme shown in Fig. 1. Appropriate 350 bp segments of 
cDNAs encoding cfi Tsoforms were amplified by PGR using as 
template plasmids obtained from the laboratory of D. Cleve- 
land (31-34). PCR was performed either in the presence of 
1.5-3.0 mM MgCl 2 /0 mM MnCi 2 (high fidelity conditions) or 
0.5-1.5 mM MgCl 2 /0.25-0.75 mM MnCl 2 (low fidelity condi- 
tions; see ref. 35). Amplification primers contained restriction 
endonuclease sites near the 5' ends; the forward primer added 
Sail and BgUl sites to the amplified DNA while the reverse 
primer contained a BamHl site. The amplified product was 
digested with Sail and Bam HI and was inserted into Sail/ 
item Hi-digested pBluescript SK, generating pBluescript-cj3 
plasmids. The 1-bp changes in the c)32 substrates were intro- 
duced by site-directed mutagenesis of the c/32a 3' cassette 
plasmid using the Chameleon kit (Stratagene). Other se- 
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Fig. 1. Construction of inverted repeat substrates. The 
pGAL-HlS3\\intron construct contained on plasmid pSR266 is shown 
at the top. Open boxes correspond to HIS3 sequences, solid boxes to 
artificial i citron sequences, and cross-hatched boxes to c/3 sequences; 
boxes are not to scale. Only those restriction sites relevant to the 
constructions are shown: Sal, Sail; Sma, Smai; Bam, BamHl; Spe, 
Spei; Not, Notl, Nae, Nael; Bgl, Bglll. 



quence changes were introduced as a result of PCR errors. The 
c/3 inserts were sequenced as appropriate. 

Yeast Strain Constructions. All strains used in this study 
were derived by LiAc transformation (36) of isogenic strains 
SJR231 (MATaade2-101achis3A200ura3~Nhe) and GCY121 
(MATa ade2~101 oc his3L200 ura3-Nhe msh2& msh3kvhisG\ 
see ref. 17). Plasmids containing the inverted repeat constructs 
were targeted to integrate at the URA3 locus by digesting DNA 
with Siul before transformation. Ura + transformants were 
analyzed by Southern blot analysis to identify those containing 
only a single copy of the plasmid integrated at URA3. 

Measuring Recombination Rates. Recombination rates 
were determined by the method of the median (37) as follows. 
Two-day-old colonies were excised from YEPD plates, inoc- 
ulated into 5 ml of YEPGG medium, and grown for 2 days on 
a roller drum. Cells were harvested, washed once with sterile 
H20» and resuspended in 1 ml H 2 0. Aliquots (100 of 
appropriately diluted (or undiluted) cells were plated on 
SGG-his selective medium to determine the number of His + 
recombinants per culture or on YEPD to assess the total 
number of viable cells per culture. His + colonies were counted 
on day 4 after selective plating. The wild -type and MMR- 
defective strains containing a given pair of substrates were 
grown and plated at the same time. Generally 4-6 independent 
cultures were plated on a given day, and a minimum of 10 
independent cultures (representing at least two independent 
platings) was used for each rate determination. Recombina- 
tion rates between different strains were statistically compared 
using the x 2 method described by Wierdl et al (38). 

Estimation of Minimal Efficient Processing Segment 
(MEPS). With a knowledge of the sequence divergence be- 
tween pairs of substrates, one can estimate the length of a 
MEPS (3) for recombination as being approximately equal to 
the slope of the regression line of In (recombination rate) 
versus sequence divergence (39), where the sequence diver- 
gence in the regression equation is expressed on a scale from 
zero (perfect identity) to one (no identity). This method is 
applicable to a variety of data sets as it does not require 
knowledge of the exact number of mismatches in any given 
segment. If the exact number of mismatches is known, a better 
approximation of the MEPS is given by L(l - e°), where L is 
the length of the substrates (350 bp in the experiments 
reported here) and a is the slope of the regression of ^(re- 
combination rate) versus the number of mismatches. 

RESULTS 

Intron-Based Recombination Assay System. The essential 
features of the intron-based recombination system used in this 
study are illustrated in Fig. 1; a detailed description of the 
system can be found in Datta et al. (17). The starting plasmid 
in all manipulations was an integrative URA3 vector contain- 
ing a HIS gene into which an artificial intron had been inserted 
(pGAL-HIS3\. intron). A plasmid containing a 5' recombina- 
tion cassette (the 5' end of HIS3, the 5' portion of the intron* 
and a recombination substrate) was constructed by replacing 
sequences downstream of the 5' intron splice consensus ele- 
ment with a 350-bp fragment derived from one of several c/3 
cDNAs (c/32, c/33, c/36, or c/37). A plasmid containing a 3' 
recombination cassette (a recombination substrate, the 3' 
portion of the intron, and the 3' end of HIS3) was constructed 
by replacing sequences upstream of the intron TACTAAC 
element with a 350-bp fragment derived from a c/3 cDNA. To 
combine 5' and 3' cassettes into a single plasmid, a fragment 
containing the 5' cassette was inserted in inverted orientation 
at the 3' end of the 3' cassette. Finally, a single copy of this 
plasmid was integrated at the URA3 locus on chromosome K 
Crossing-over between the 5' cassette c/3 sequences and the 3' 
cassette c/3 sequences inverts the region between them, placing 
the 5' and 3' parts of thcHIS3 gene in the same orientation and 
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reconstituting a functional c0-containing intron. Because the 
recombinant c)3 sequences are spliced out of the primary HIS3 
transcript* there are no functional constraints on the recom- 
bination products and the degree of sequence identity can be 
varied over a very broad range. It should be noted that this 
system detects only crossover events and will not detect simple 
gene conversions. 

Combinations of 5' and 3' cassettes containing c)8 sequences 
derived from cDNAs encoding different )3-tubulin isoforras 
were used to achieve variable amounts of sequence divergence. 
For example, a c02 5' cassette and a c)37 3' cassette were 
combined to generate 91%-identicaI substrates. In addition to 
directly using c)3 cDNA sequences, random mismatches were 
introduced by mutagenic PCR or specific mismatches were 
introduced by site-directed mutagenesis. As controls for the 
diverged sequences, 100%-identical substrates were con- 
structed by combining 5' and 3' cassettes containing the same 
c/3 sequences (e.g., c|32a/c/32a). The distributions of noniden- 
tical bases in the recombination substrates are shown in Fig. 2. 

Recombination Rates Between Mismatch-Containing Sub- 
strates. Our previous work (17) demonstrated that recombi- 
nation between sequences containing many mismatches is 
regulated by the MMR machinery, but provided no informa- 
tion concerning the number of mismatches that are necessary 
to trigger this antirecorabination role. To specifically address 
this issue, substrates containing one or a few mismatches were 
introduced into the genomes of both wild-type (MMR + ) and 
msh2Amsh3A (MMR~) strains. Given what is known about 
yeast MMR, a msh2kmsh3A double mutant should be devoid 
of all mismatch repair activity (28). Recombination rates 
between the substrates were inferred by measuring the rates of 
His + prototroph formation (Table 1). Five different substrate 
pairs of 100% identity were analyzed as controls: c/32a/c/32a, 
cj33a/c/33a, c/33b/c/33b, c/36/c06, and cj37a/c)37a. Each pair of 
100%-identical substrates recombined at a rate of approxi- 
mately 1 X 10 " ft in the MMR + background. For each substrate 
pair, the rate of recombination was elevated 2- to 4-fold in the 
MMR" strain relative to the MMR + strain. 

Five substrates containing a single base substitution were 
constructed to vary the type, sequence context, and position of 



Table 1. Rates of His + recombinants 
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Fig. 2. Alignments of recombination substrates derived from c0 
cDNA sequences. All sequences are approximately 350 bp. Each 
potential mismatch between a given pair of sequences is indicated by 
a vertical line. 
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c£ sequences were numbered as in the GenBank files; accession nos. 
for c02, c|33, c06, and c|37 are M11443, M14228, J02828, and X07011, 
respectively. c02a sequences coaslsted of nt 690-1038, c02b sequences 
of nt 657-1007, c03a of nt 1093-1441, cj33b of nt 1391-1741, c06 of nt 
429-779, cj37a of nt 450-800, and c07b of nt 781-1129. For substrates 
containing a single mismatch, the nature and position of the mismatch 
are indicated as a subscript The c02 sequence with three mismatches 
(c02-3mm) contains the following changes: A769T, A934G, and 
T942C The c03a sequence with four mismatches (c03a-4mm) con- 
tains the following changes: A1216G, A1228G, C1297T, and deletion 
of G1101. Two types of statistical analyses indicate that a 2-fold 
difference between recombination rates Is significant. First, the c02a/ 
c02a substrates were used as an internal control in most fluctuation 
experiments, which yielded 15 different, but very similar, rate mea- 
surements (all the data were pooled for the rate used in the table). 
Using these independent rate measurements, the mean rate Is 98 X 
10-* and the SD Is 18 X 10" 8 . Second, in addition to the high 
reproducibility of rate measurements, representative rates were sta- 
tistically compared: c06/c06, MMR + versus c06/c06, MMR"(^ = 
6.0, P < 0.05); C06/C06, MMR + versus c)37a/c07a, MMR + (^ « 10.7, 
P < 0.01); C06/C06, MMR" versus C07a/c07a, MMR-tf = 24, P < 
0.01), C03b/c03b, MMR + versus c03b/c03b, MMR" Ok 2 = 25.6, P < 
0.01); c03/cj83ri53ic, MMR + versus c03/c03ns3ic, MMR-tf = 22, 
P < 0.01); C03b/c03b, MMR + versus c03/c03ns3ic, MMR + (x 2 = 
22, P < 0.01); and C03b/c03b, MMR" versus c03/c03n53ic, 
MMR-tf - 0.18, P> 0.50). 

the potential mismatch. Four were derived from c)32a and one 
was derived from c/33b. For the c/32a substrates (nucleotides 
690-1038 of c/32; see legend to Table 1), an A to G transition 
was introduced at position 884, a G to C transversion at 
position 883, and either a G to A transition or a G to C 
transversion at position 827. For the c/33b substrates (nucle- 
otides 1391-1741 of c/33) a T to C transition was introduced at 
position 1531. The transitions result in either an A-C or G-T 
mismatch in heteroduplex recombination intermediates; the 
transversions yield either a G-G or C-C mismatch. All recom- 
bination-generated mismatches with the exception of the C-C 
mismatch should be recognized efficiently by the yeast MMR 
machinery (40). For each of the 5 one-mismatch substrates 
examined, there was a 10- to 20-fold elevation in the recom- 
bination rate in the MMR" strain relative to the MMR + strain. 
If one compares these rates to those obtained with the 100% 
control substrates, a single mismatch uniformly lowered the 
rate of recombination 3- to 4-fold in the MMR + background 
but had no effect on the recombination rate in the MMR" 
strains. We conclude that a single mismatch can trigger the 
antirecombination function of the MMR machinery in yeast 
and thereby impede homologous recombination. The MMR 
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system did not appear to discriminate between the potentially 
different types of mismatches used here. 

To examine the antirecombination effect of multiple mis* 
matches, substrates containing either three or four mismatches 
(c)32a-3mra and cj33a-4mm» respectively) were introduced into 
the MMR + and MMR" strains. For each of these substrates, 
there was a 30-fold elevation in the recombination rate in the 
MMR" strain relative to the MMR + strain. As with the 
one-mismatch substrates, a comparison to the 100% control 
substrates indicates that three or four mismatches impact 
recombination only in MMR-corapetent cells. 

We reported previously recombination rates between 91%- 
identical and 77%-identical substrates in wild-type and MMR- 
defective strains (17). To examine more systematically recom- 
bination rates between sequences in the 95% to 75% identity 
range, additional substrates with identity levels of 94%, 85%, 
82%, and 74% were constructed. The 91% substrates (30 
mismatches) used in the previous study were derived by 
combining c|32a and c/37b sequences; the 94% substrates (21 
mismatches) were obtained by low fidelity PCR using c)32a as 
a template. Although the 94% and 91% substrates recombined 
at similar rates in-MMR-competcnt cells, elimination -of-MMR 
elevated recombination between the 94% substrates 140-fold 
but elevated recombination between the 91% substrates only 
a 45-fold. This difference in behavior of the 94% versus 91% 
substrates may be related to the very different distributions of 
mismatches in these two substrate pairs (see Fig. 2). When 
compared with the 100% control substrates, 20-30 mismatches 
reduced recombination rates about 50-fold in MMR- 
competent cells. There was a relatively modest reduction in 
recombination rates (2- to 3-fold) associated with this level of 
divergence in MMR-defective cells. 

With the more highly diverged 85%- and 82%-identical 
substrates (52 and 64 total mismatches, respectively), there was 
an approximately 80-fold increase in recombination rates in 
the MMR" strains relative to the MMR + strains. If one 
compares these rates to those obtained with the 100% control 
substrates, however, the 200-fold decrease in recombination 
rates observed in the MMR + strains was not eliminated in the 
MMR" strains. Even in the MMR" strains this level of 
divergence reduced recombination rates approximately 10- 
fold. The trend observed with the 85%- and 82%-identical 
substrates was even more striking with the 74% substrates. 
When compared with the 100% control substrates, the recom- 
bination rate between the 74% substrates was reduced 4000- 
fold in MMR-competent cells. Although elimination of the 
MMR system improved recombination between the 74% 
substrates, the rate was still approximately 200- fold lower than 
that observed with the 100% control substrates. 

DISCUSSION 

An intron-based, inverted repeat assay was used to examine 
recombination between identical substrates and between sub- 
Table 2. Summary of recombination rates 



strates containing variable numbers of mismatches (Fig. 2). All 
substrates were derived from c/3 cDNAs and were approxi- 
mately 350 bp long. The substrate length was held constant to 
insure that recombination rates were affected only by the 
mismatches present. Because detection of recombination re- 
quires inversion of the region between the repeats (Fig. 1), this 
system only identifies reciprocal exchange events; simple gene 
conversions cannot be detected. To assess the role of the 
mismatch repair machinery in regulating recombination 
between nonidentical sequences, recombination rates were 
determined in both wild-type and MMR-defective 
(msh2kmsh3&) strains. Table 1 presents the recombination 
rates for each substrate pair and these data are summarized in 
Table 2. 

Recombination rates between five different pairs of identi- 
cal (100%) substrates were examined and these rates served as 
controls for subsequent experiments involving mismatched 
substrates. In MMR-competent strains, the rates varied from 
0.7 X 10" 6 for the cj36/cj36 substrates to 1.7 X 10" 6 for the 
cj37a/c)37a substrates. The slight but statistically significant 
(see legend to Table 1) variation in rate with substrates of the 
same size indicates that the actual sequence of bases may have 
subtle effects on recombination. Such subtleties could have 
their origins in either preferred sites for recombination- 
initiating lesions or preferred sites for resolving recombination 
intermediates. Sequence preference for resolution has indeed 
been observed in the cleavage of Holliday junctions in vitro by 
the bacterial RuvC protein (41). With each of the five 100% 
substrates analyzed, the recombination rate in the MMR- 
defective strain was elevated relative to that in the MMR + 
strain; the average increase was 2.7-fold. Although the MMR 
machinery would not be expected to inhibit recombination 
between identical sequences, we suggest that the MMR system 
may be detecting either intrastrand secondary structure or 
unpaired regions resulting from branch migration of Holliday 
junctions into the nonhomologous regions that flank the 
recombination substrates. Regardless of the molecular basis of 
the increased recombination in MMR" strains seen here, it is 
important to account for this increase when assessing the 
effects of mismatches on recombination rates. It should be 
noted that the inhibitory effect of MMR proteins on recom- 
bination seen in this study is in contrast to the small stimulatory 
role reported elsewhere (42); the conflicting results may 
reflect differences in the assay systems used. 

The effect of a single potential mismatch on recombination 
rate was determined using five different substrate pairs, each 
of which contained a defined base substitution. Although both 
the type and position of the base substitution were varied, all 
5 one-mismatch substrates recombined similarly. In the 
MMR + strains, the average rate of recombination between the 
mismatch-containing substrates was approximately 4-fold 
lower than the average recombination rate between the 100% 
substrates. In the MMR" strains, however, the average rate of 
recombination between the mismatch-containing substrates 
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For each homology level, the rates shown were obtained by averaging the rates observed with each 
substrate at that level (see Table 1). The number in parentheses following each rate represents the fold 
increase ( f ) or decrease ( j ) relative to the rate obtained with the 100% substrates in the same MMR + 
or MMR" background. 
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was the same as that between the 100% substrates. These data 
demonstrate that a single mismatch is sufficient to impair the 
recombination process in yeast and that this impairment 
derives solely from action of the MMR machinery. Although 
a single mismatch within 31 bp substrates also was observed to 
inhibit recombination in £. colt, the inhibition was not due to 
the action of the mismatch repair machinery (15). 

To more precisely define the impact of sequence divergence 
on mitotic recombination, additional substrate pairs with 
identities ranging from 99% (3 mismatches) to 74% (90 
mismatches) were examined. In MMR-corapetent cells there 
was a cumulative negative effect of mismatches on recombi- 
nation rates (Table 2). Whereas a single mismatch reduced 
recombination rates approximately 4-fold, the presence of two 
to three additional mismatches decreased recombination 9- 
fold relative to the 100% substrates. This trend continued in 
the MMR + strains, with the 91-94% identical substrates 
recombining 50-fold less efficiently, the 82-84% substrates 
recombining 300-fold less efficiently and the 74% substrates 
recombining 4600-fold less efficiently than the 100% control 
substrates. In striking contrast to MMR-competent strains, 
there was Httie effect of sequence divergence on recombina- 
tion rates in MMR-defective strains until the identity level 
dropped below the 91-94% range. This indicates that, with 
sequences of greater than 90% identity, all of the inhibitory 
effect of sequence divergence on recombination can be attrib- 
uted to action of the MMR machinery. Once sequences pass 
a divergence threshold of 5-10%, however, a factor other than 
the MMR system strongly impairs the recombination process. 
As suggested previously (17), we believe that this additional 
impairment likely reflects a general limitation of the yeast 
recombination machinery, which could correspond to an in- 
ability to efficiently initiate strand transfer or resolve inter- 
mediates as crossovers. Although the yeast MMR system 
imposes an antirecombination effect at all levels of sequence 
divergence examined, the maximum effect exerted by the 
MMR machinery appears to be about 30-fold (last column of 
Table 2). This level of inhibition is attained when the sequence 
divergence is in the 10-15% range. 

The relationship between sequence identity and recombi- 
nation rate in both MMR + and MMR" cells is graphically 
presented in Fig. 3. For each set of data, a roughly linear 
relationship is obtained if one plots the In (recombination rate) 
versus % sequence divergence. A similar log-linear relation- 
ship was observed in studies examining the effects of sequence 
divergence on the efficiency of bacterial conjugation and 
transformation (23, 39). A log-linear relationship between 
recombination rate and sequence divergence can be explained 
most simply by the concept of a MEPS, which is defined as the 
minimal length of perfect homology needed for efficient 
recombination. Shen and Huang (3) originally proposed 
MEPS to explain the linear relationship observed between 
recombination rate and substrate size in a bacterial recombi- 
nation assay. If one considers any piece of DNA as a linear 
series of overlapping MEPS, each of which recombines at the 
same efficiency, the MEPS concept postulates that the rate of 
recombination should be directly proportional to the number 
of MEPS in a particular sequence. Because the number of 
MEPS increases linearly with substrate length, the MEPS 
concept accounts for the observed linearity between substrate 
length and recombination rate. If the lengths of the substrates 
are held constant and random mismatches are introduced, the 
number of MEPS decreases exponentially as the sequence 
divergence increases. 

For substrates with a fixed length and varying numbers of 
mismatches, as in the data reported here, the length of the 
MEPS can be estimated according to the equation described 
in the Materials and Methods. This technique gives a MEPS of 
18 bp [14-21 bp 95% confidence interval (CI.)] for the MMR" 
strains, and 28 bp (23-33 bp 95% CI.) for the MMR + strains. 
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% Divergence 

Flo. 3. Recombination rate versus % sequence divergence. All the 
data in Table 1 are graphed. (Inset) Data for the 0-4 mismatch 
substrates only. Open and solid circles correspond to data from 
MMR + and MMR" strains, respectively. The curves were derived 
using the model described in the text. Nonlinear curves were fit using 
the simplex method of systat 5.0 (Macintosh). Different starting 
conditions for the nonlinear curve fitting program produced nearly 
5-fold variations in the fitted values of Ro and Po, which were 
statistically indistinguishable from each other; fitted values of the other 
parameters remained approximately constant with a range <1%. 

As alluded to above, however, visual examination of the 
MMR + data indicates a very rapid dropoff in crossover rates 
at low sequence divergence, followed by a leveling off to a slope 
similar to that observed with the MMR" data. This behavior 
suggests a model in which heteroduplex forms with a proba- 
bility that declines exponentially with sequence divergence; 
heteroduplex avoids triggering mismatch repair with a prob- 
ability that also falls off exponentially with sequence diver- 
gence (but at a different rate); and mismatch repair, if trig- 
gered, blocks recombination with a certain, fixed probability. 
In such a model, heteroduplex formation has probability 
Aye _otr ; mismatch repair is triggered with probability l-Rtf'P* 
and is effective (if triggered) with probability /. In these 
expressions, x is the divergence (measured on a scale of 0-1) 
between the recombining sequences; Po is the probability that 
identical substrates will form a heteroduplex; and R n is the 
probability that a heteroduplex between identical substrates, 
once formed, will avoid triggering mismatch repair. Within the 
framework of the MEPS concept, the values a and J3 corre- 
spond to the mismatch-free sequence length required to 
initiate heteroduplex formation and the mismatch-free se- 
quence length necessary to escape MMR activity, respectively. 
With these assumptions, the probability of a crossover event is 

Poe-^l ~f(\ - Riff*)]- 

The log-transformed version of above model was fit to the 
data for the MMR + strains, and the fitted model is shown as 
the solid curve in Fig. 3. The fitted values are a ~ 23, /3 = 610, 
/ = 0.97, P 0 = 5.1 X 10 -6 , and R {) = 0.18. It should be noted 
that Po and R$ are dependent on the substrate length, 350 bp 
in this case. Although the model is highly consistent with the 
data, this agreement should be viewed with the knowledge that 
it has four degrees of freedom and was devised after viewing 
the data. If the model is correct, it should predict the outcome 
of the MMR" experiments using the same parameters esti- 
mated from the MMR + data, but setting / = 0 (mismatch 
repair is never effective). This prediction, shown as the dashed 
line in Fig. 3, agrees well with the MMR" data. Taken 
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together, these data suggest that **20 bp of perfect homology 
are needed to initiate heteroduplex formation and that **610 
bp of perfect homology are needed to avoid the antirecom- 
bination activity of the MMR machinery. Because the latter 
estimate is longer than the length of the recombination 
substrates used here, it can account for the increases in the 
rates of recombination between 100%-identical substrates that 
accompanied elimination of MMR. 

The model above is based on the number of mismatches 
rather than on the locations of the mismatches within the 
recombination substrates. With knowledge of the precise 
positions of the mismatches in the substrate pairs (see Fig. 2), 
the number of mismatch-free stretches of any given length can 
be determined. If the MEPS concept is applicable in its 
simplest form to yeast recombination, there should be a MEPS 
length that yields a linear relationship between the number of 
MEPS and crossover rates, at least in the MMR" strains in 
which only a single process (heteroduplex formation) may be 
limiting. When this is attempted with the MMR" data, even 
the best fit (MEPS = 11 bp) is poor (r 2 = 0.62), suggesting that 
factors other than the length of mismatch-free sequence (e.g., 
base composition or preferred initiation/resolution' sites) af- 
fect recombination rate. The observation that different pairs of 
100%-identical substrates recorabined at slightly different 
rates supports the notion that recombination is influenced by 
factors other than substrate length. It is interesting that despite 
the existence of these other factors, the fit of the model used 
above to the data is quite good (Fig. 3). 

The assay used in this study depends on the resolution of 
recombination intermediates as crossover events, and so does not 
provide any information concerning when or how MMR proteins 
exert their antirecombination role. The regulation could involve 
some type of steric hindrance resulting from mismatch binding or 
it could involve the actual destruction of mismatched heterodu- 
plex. Although the destruction of intermediates is easy to imagine 
in those organisms where a nicking activity has been associated 
with the MMR machinery (e.g., MutH in£. coli\ no comparable 
nicking activity has been identified in yeast. In relation to the issue 
of when the antirecombination role of the MMR machinery may 
be exerted, it should be noted that some assay systems have failed 
to detect any impact of the MMR machinery on recombination 
between diverged sequences (21, 43). In contrast to the intrach- 
romatid, inverted repeat crossing-over assay used here, one 
system examined plasmid-chroraosome gene conversion (43) and 
the other system examined intrachromatid interactions between 
direct repeats (21). We do not think that our observations are an 
artifact of using an intrachromatid inverted repeat assay because 
essentially the same effects have been observed with nonidentical 
substrates positioned on nonhomologous chromosomes (W. 
Chen and S.J.-R., unpublished data). Although it is not obvious 
why the MMR system impacts various recombination systems so 
differently, it is nevertheless clear the MMR system can exhibit 
potent antirecombination activity. It is particularly striking that a 
single mismatch can trigger the antirecombination activity of the 
yeast mismatch repair machinery. If one extrapolates the results 
reported here to higher eukaryotes, the accumulation of chro- 
mosomal rearrangements via recombination between diverged 
sequence elements may contribute to the genetic instability of 
MMR-defective cells and may be important for tumor progres- 
sion. 
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